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HON CONTROL SYSTEM FOR ATM COMMUNICATIONS 

igestion elimination control „ 



system for an ATM commun cations network in which 
traffic of undesired connect! on is restricted so as to 
eliminate congestion by refe encing connection 
information when the congestion takes place in an ATM 
exchange. 

CONSTITUTION: When congestion takes place in ATM 



exchanges 1, 2, 3 in an ATM 



communications network a. 




a network management equii >ment 4 receiving the notice 
of occurrence of the conges bion via logic 
communication-5' circuits 9, 10, 11 references 
connection attribute informs tion stored in a connection 
attribute table 5 provided to the network management 
equipment 4. Then any of cc nnection sets A-D set in a 

management area p of the network management r w A txa 

equipment 4 is selected corresponding to the referenced ~ — — — t ■ / 
connection attribute information and gives a command 
abort of part or all of the connection sets A-D selected 
through logic communications circuits 9, 10, 11 to the 

ATM exchanges 1 , 2 r 3 via tl le selected connection sets A~D. Thus, the traffic of the 
connection sets A-D is restricted to eliminate the congestion caused in the ATM exchanges 1, 
2. 3. 
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Image Coding Using Zerotrees of Wavelet 
Coefficients 

Jerome M. Shapiro 



steroiree wavelet algoriiW tEfcW) 



s ■ simple, yd rcmaj-kab). effective, Image compression algo- 
rithm, baWnt Ibt proper*: Wis In Ibe bit stream arc 
generated lb order of iiof iruncc, yielding * fully embedded 
code. Tbt embedded code ^presents a sequence of binary dc- 
cisloiu that distinguish. «n 1 nage Iron ibe "nnU" Image. Usro R 
an embedded coding *h*er thrp, «a encoder can icrmlnale the 
escodtni *| nay poim Iher rhy allowing » targcl rate or target 
distortion metric » be &n ex>cUy. Also, given p brt stream, 
the decoder od cease dec* iinff at mj point in tbc fail Slrtaux 
BPd ftin produce exactly li e same image *hai would have been 
encoded at (be bit rale c »rrcspondI^ to Ibe truncated bft 
Arcazn. In addiikm la proj uclne > '*dly embedded bit llrtanu 
compreafiioB rcsuhs ihat pre com- 
petitive wfcb virio-Dy «|| I uxrwit corapiegwD algorithms on 
standard lest Images. Ye* ■ ris performance fa achieved with » 
technique thai requires abialuleljf no training, mo prcxlored 
tables or code book*, and' flwjuires »o prior knowledge of Ibe 

an four key concepts: 1) a drv 
hierarchic*! subbaod deeotnposi- 
of Significant InroroiBCiox) 
the self-sUtiilaj-ily inherent In tm- 
ij>r«approzlniatioq <t*'pn*irHrftti. 
compnsmp wbtcb is achcered via 



Tbe E2W algorithm is 
crcte wavelet transform 
lion, 2) prediction of Ibe 
across scales by c*ploiUzi| 
ago, 3) entropy-coded n 
and 4) universal lossless da I 
adaptive arithmetic Coding 



I. !ntb,oduction >nd Problem Statement 

THIS paper addresses the two-fold problem of 3) ob- 
taining the best imag quality for a given bit rate, and 
2) accomplishing this tas c in an embedded fashion, i.e., 
in such a way thai all e >codings of the same image ax 
lower bit rates are embe ded in the beginning of the bit 
stream for The target bit i lie. 

The problem is import; m in many applications* partic- 
ularly for progressive trai emission, image browsing 125), 



digital hierarchy of muliij 
io transmission over a uo 



I LIU 
le 



ordering or the bits in ore sr of importance leads nalurally 
to prioritization for the purpose of \Bycrc6 protection 
schemes. 
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bit rates. It is also applicable 
sy channel in the sense that the 



A. Bmb€ddcd Coding 

Aj> embedded code represents a sequence of binary de- 
cisions thai distinguish an image from the "null," or all 
gray, image. Since, the embedded code contains ill lower 
rate codes "embedded" at the beginning of ihe bit stream, 
effectively, the bits Are "Ordered in importance." Using 
an embedded code, an encoder can terminate the encoding 
at any point thereby allowing a target rate or distortion 
metric to be raei exactly. Typically, some target param- 
eter, such as bit count, is monitored in the encoding pro- 
cess. When the target is met, the encoding simply stops. 
Similarly, given a bit stream, the decoder can cease de- 
coding al any point and can produce reconstructions cor- 
responding to aU lower- rate encodings. 

Embedded coding is similar in spirit to binary finite* 
precision representations of real numbers. AU real num- 
bers can be represented by a String of binary digits. For 
each digit added to the right, more precision is added. 
Yei T the "encoding" can cease at any time and provide 
the "'best" representation of the real number achievable 
within the framework of the binary digit representation. 
Similarly, the embedded coder can ceafie at apy time and 
provide the "best** representation of an. image achievable 
within its framework; - 

The embedded coding scheme presented here was mo- 
tivated in part by universal coding schemes that have been 
used for lossless data compression in which the coder at- 
tempts to optimally encode a source using no prior knowl- 
edge of the source. An excellent review of universal cod- 
ing can be found m [3]. In universal coders, the encoder 
must learn the source statistics as it progresses. In other 
words, the source model is incorporated huo the actual bit 
stream. For lossy compression, there has been Hide work 
in universal coding. Typical image coders require exien- 
Sivc training for both quantization (both scalar and vector) 
and generation of nonadaptjve entropy codes, Such as 
Huffman codes. The embedded coder described in this pa- 
per attempts U) be universal by incorporating all learning 
into the bit stream itself. This is accomplished by the ex- 
clusive use of adaptive arithmetic coding. 

Intuitively, for a given rate or distortion, a nonembed- 
ded code should be more efficient than an embedded code, 
since it \g free from the constraints imposed by embed- 
ding. In their theoreticaj work I9] r Eqiaitz and Cover 
proved chat s successively refutable description can- only 
be optimal if the source possesses certain Mackovian 
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opdjmaliry is never claimed, a 
cj ibedded bit stream with do ap- 
qjpality has been developed. 



B. FeOfures of ihe Embed* ed Coder 

The EZW algorithm cor tains the foil owing features 

* A discrete wavelet In isfarm which provides a Com* 
pact multircsolution r presentation of the image. 

* Zeroirce coding which provides. & compact muUi- 
resolution rcprcsentati >n of significance maps, which 
are binary maps indk ujng the positions of (be sig- 
nificant coefficients. '. LeroLrees allow the successful 
prediction of insignif iant coefficients across scales 
to be efficiently repre ented as pan of exponentially 
growing trees. 

* Successive Approxirr uion which provides a com- 
pact mubiprtcision n [ttesexrtation of the significant 
coefficients and £acilii lies the embedding algorithm. 

- A prioritization protoc al whereby lhc ordering of im- 
po nance is determine 1, in order, by the precision, 
magnitude , Scale, and spatial location of the wavelet 
coefficients. Note in particular, that larger coeffi- 
cients are deemed mo jc important than smaller coef- 
ficients regardless of heir scale. 

* Adaptive multilevel rithmetie coding which pro- 
vides a fast an*) effici m method for entropy coding 
strings of symbols, aj d requires no training or pre- 
sided tables. The ari h me tic coder used in the ex- 
perimenLs is a custom red version of that in [31]. 

* The algorithm runs se }uentially and stops whenever 
a target bit rate or a ea gel distortion is met. A target 
hit rate can be met ex \cr\y. and an operalionaj raic- 
vs .-distortion function (RDF) can be computed po rat- 
by -pom I. 



ho v 



C Paper Organization 
Section II discusses 
resolution analysis pro v id- 
representing 1 'trends" * and 
equal footing. This is important 
cause edges, which can be 
spatial domain, represent 
(ion despite that fad thai 
liny fraction of the image 
the concept of a zeroirce 
can efficiently encode a &ig 
5cicnts by predicting the 
tion across scales, ScctionJlV 
approximation qtiaslizatioi 
zeroirce coding, and ariihmfci; 
embedded coding. A 
by which EZW attempts u 
ponartcc. A key point il 
porta nee for the purpose 
on the magnitudes of the un 
the viewpoint of what the 



& id \ 



, disci is ion , 



wavelet theory and multi- 
an elcganl methodology for 
anomalies" on a statistically 
in image processing be* 
nought of as anomalies in the 
Xtremely important uifoima- 
bey are represented in only a 
spmplcs. Section HI introduces 
shows how zerotrec coding 
ificance map of wavelet cocf- 
absence of significant informa- 
dlscusses how successive 
is used in conjunction with 
ic coding to achieve efficient 
follows on the protocol 
order the bits in order of im- 
is that the definition of inv 
of|ordcr]ng information is based 
enainiy intervals as seen from 
ecoder can figure out. Thus, 



there is no additional Overhead to transmit Ibis ordering 
information. Section V consists of a simple 8x8 ex- 
ample illustrating the Various points of the EZW algo- 
rithm. Section VI discusses experimental results for var- 
ious rates and for various standard test images. A 
surprising result is that using the EZW algorithm. Lami- 
nating the encoding at an arbitrary point in the encoding 
process does no I produce any artifacts thai would indicate 
where in the picture the termination occurs. The paper 
concludes with Section VU. 

II. Wavelet Theory and M o ltireso lution 
Am a lysis 

A. Trends and Anomalies 

One of the oldest problems in statistics and signal pro- 
cessing is how to choose the size of an analysis window, 
block size, or record length of data so that statistics com* 
putcd within that window provide good models of the sig- 
nal behavior within that window. The choice of an anal- 
ysis window involves trading the ability to analyze 
"anomalies,*' 1 or signal behavior that is more localized in 
the time dt space domain and tends to be wide band in the 
frequency domain, from "trends/* or signal behavior that 
is more localized in frequency but persists over a huge 
number of lags in ihe tunc domain. To model data as being 
generated by random processes so that computed statistics 
become meaningful, stationary and ergodic assumptions 
are usually required which lend to obscure the contribu- 
tion of anomalies. 

The main contribution of wavelet theory and mulurcs- 
olution analysis is that it provides an elegant framework 
in which both anomalies and Trends can be analyzed on 
an equal footing. Wavelets provide a signal representation 
in which some of the coefficients rep resent long data lags 
corresponding to a narrow band, low frequency range, and 
some of the coefficients represent short data lags corre- 
sponding to a wide band, high frequency range. Using the 
concept of scale, data representing a continuous tradeoff 1 
between lime (or space in the case of images) and fre- 
quency is available. 

For an introduction to the theory behind wavelets and 
multj resolution analysis, the reader is referred to several 
excellent tutorials on the subject [6], \7], [17], 118], f2o], 
126). \21]- 

B. Relevance 10 Image Coding 

In image processing, most of the image area typically 
represents spatial 4 Trends/* or areas of high statistical 
spatial correlation. However "anomalies/* such as edges 
or object boundaries, take on a perceptual significance ihzt 
is far greater than their numerical energy contribution to 
an image. Traditional transform coders,, such as those us* 
ing the DOT, decompose images into a represcmaUon in 
which each coefficient corresponds to a fixed size spatial 
area and a fixed frequency bandwidth, where the band- 
width and spatial area arc effectively the same for b11 coef- 
ficients in the representation. Edge infonnarkra tends to 
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disperse so ihai many noi -zero coefficients are required 
io represent edges with go >d fidelity. However, since ihe 
edges represent relatively insignificant energy wUh re- 
spect io the entire image traditional transform code re, 
such as those using the t> tT t have beco fairly successful 
at medium and high bit ra cs. Al extremely low bit rates* 
however, traditional trans arm coding techniques . such as 
JPEG [30], tend to aUocat ioo many bits to the "trends," 
and have few bite left ovt co represent "anomalies." As 
a result, blocking artifact! often result. 

Wavelet techniques she * promise al extremely low bit 
raxes because trends, szk italics, and information at all 
"scales" in between are ivailablc A major difficulty is 
ihnf fine detail coefficient : representing possible anoma- 
lies constitute ihe Jargc: , number of coeffictcnLs, and 
thereto re, io make effee ve use of the mulii resolution 
representation, much of be information is contained in 
representing the position >f those few coefficients corre- 
sponding to significant at imalies. 

The techniques of this ] aper allow coders to effectively 
use tbc power of muli/re oluiion representations by eft- 
ciendy representing the osilions of the wavelet coeffi- 
cients representing signifi :ani anomalies. 

C. Discrete Wavelet 7 \tmsform 

The discrete wavelet ransfbrrn used in this paper id 
identical to a hierarchical Bubband system, where the sub- 
bands are logarithmically spaced in frequency and repre- 
sent an octave* band deco nposiikm. To begin the decom- 
vided Into four subbands and 
>wn in Fig. I. Each coefficient 
rresporjdmg to approximately 
image. The low frequencies 
ixixnaxery corresponding to 0 
high frequencies represent ihe 
, The four subbands arise from 
rtical and horizontal filter*. The 
and HH% represent the finest 
To obtain the neat coarser scale 
subband LL\ is further decern- 
ed as shown in Fig. 2. The pro- 
Jtnar scale is reached. Note that 
coefficients represent a larger 
rat a narrower band of frcqaca- 
are three subbands; the remain- 
ing lowest frequency su >hahd is a representation of the 
information al all coarse r scales. The issues involved in 
the design of the fillers for the type of subband decom- 
position described abov have been discussed by many 
authors and are not treat rd in this paper. Interested read- 
en Should consult [11. f ». T32). [351 . in addition to ref- 
erences found in the bib iographies of the tutorial papers 
cited above. 

It is a mailer of lenn nology io distinguish between a 
transform and a subfaam system as rhcy are two ways of 
describing the same set f numerical operations from dif- 
fering points of view. 1 et jr be a column vector whose 
elements represent a srdhning of the image pixels, let X 



position, the image is 
critically subsamplcd as 
represents a spatial area 
a 2 x 2 area of the origi 
represent a bandwidth a] 
< |u| < a-/2, whereas 
band from x/2 < ju| < 
separable application of 
subbands labeled LH ti 
scale wavelet coetnc'n 
of wavelet coefficients, 
posed and critically sam; 
cess continues until 
for each coarser scale, 
spatial area of ihe imag< 
cies, Al each scale, the: 



III 


HI, 




nut 



Fi*. I , Fi«i aas* ^ a di«re« UwuJomi: Ta* image is diyidwl 

'noi rbcrsubburfa «rflns «epBimble ftteii. Each coefficient nprcacntt » 51*- 
U*] vtA Domapandirifi 10 ippooxInuiEly ■ 2 X2«iarite arigj**] pic- 
low. The k»v frequencies irprescni ■ t»w3widU\ njpraxinwtclj torn* 
spondinc l» 0 <: H < Tp- -Rtre»a l*« bijb frr<j^ocic» txr?rrxc** ihe 
biMj tmm*/2 < |u( < t. The foarwtbiodS arise from «ep»nble mpjia- 
calion of *CJliCaI »ed horizontal filters. 





JO* 


JJLi 


LJii 









a iwo-icalc raelei ^ttomfiauCaa- The image ii divided imto four 
ling iCfwftW* nUo^E^m^itlltfBakcxubbaMfalX^.L^j, 
HLj 9*4 MJi} R7CE9CD13 » l?*ti*l m cmTDpood"u« I* *PP toxloaieiy a 4 
X 4 ana or <fcc ori$i"*J pknnie. Tbe 1w frtoueocics »l Ibia ifialr TCpWAXA 
& bMtdwkhh >ppmximBl£ly companding in 0 < («*[ -C whei«i»tbe 
high riEOBCPtiWt Kfsesem ihe bsnd Tftm -r/4 < |&»| < 

be a colunin vector whose elements are the array of coef- 
ficients resulting from the wavelet transform or subband 
decomposition applied to jr. From the Transform point of 
view, X represents a linear transformation of x repre- 
sented by the matrix W, i.e. , 

X = Wa\ (1) 

Although not actually computed this way, the effective 
Mters that generate the subband signals from the original 
signal Torm basis functions for the transformation. i.e. , 
the rows of VK Different coefficients in the same subband 
represent Ok projection of the entire image OtttO translates 
of a prototype subband filter, since from the subband point 
of view, they are simply regularly spaced different onmuts 
of a convolution between the image and a subband filter. 
Thus, the basis functions for each coefficient in a given 
subband are simply translates of one another. 

In subband coding systems [32], ihe coefficients from 
a gi*cn subband are usually grouped together for the pur- 
poses of designing quantizers and coders- Such a group- 
ing suggests thai statistics computed from a subband are 
in some sense representative of the samples in that sub- 
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band. However ihis stati tical grouping once again im- 



OUlliers, which lend to repre- £g. h*— ** 



plicitly de-emphasizes the 

seat the most significant ai wnalies or edges. In this paper, 
ihc term M wavelet iransfoj br is used because each wave- 
lei coefficient is individu* Lly and dciernunktic^lly eom- 
pared 10 the same set of thresholds for the purpose of 
measuring significance. T tus, each coefficient is treated 
Ljportaut piece of data regard- 
less of its scale v and no su tistics for a whole subfaand are 
used in any form. The res ill is thai the small number of 
"aeterramistically" signif cam fine scale coefficients are 
not obscured because of 0* ir "statistical" insignificance. 
The fillers used to CCmj ite the discrete wavelet trans- 
form in the coding cxpermi aits described in this paper are 
based on the 9- Up symij :lric quadrature mirror filters 
(QMF) whose coefficients ire given in 1 1]. This transfor- 
mation has also been called] a QMF-pyramid. These filters 
tion to their good localization 
Hows for simple edge treat- 
' results empirically. Addi- 
tied coefficients, the transfer- 
wavelet transform obtained 
lo unitary that it can be 
rpose of lossy compression. 



were chosen because 
properties, their sy: 
ments, and they produce 
UO Daily, Using properly 
mation matrix for a di: 
using these fillers is so 
treated as unitary for the 



Since unitary trans forms reserve norms, it makes 
sense from a numerical sti id point to compare aO of the 
resulting transform coefficients to the same thresholds lo 
assess significance. 

HI. ZeRQTREES OP \ 'AVELET COEFFICICMTS 

Tn this section, an impor mi aspect of low bit rate im- 
age coding is discussed: tie coding of the positions of 
those coefficients that will mc transmitted as nonzero val- 
ves. Using scalar qmuxtrzat on followed by entropy cod- 
ing, in order to achieve ver t low bit rales, i.e., less than 
1 bit/pel. the probability o the mosi likely symbol after 
quantization— the zero sym ol— must be extremely high. 
Typically , a large fraction c the bit budget must be spent 
on encoding the significant mnp, or the binary decision 
as to whether a sample, in i is case a coefficient of a 2-D 
discrete wavelet transform, has a zero or nonzero quan- 
tized value. It follows that i significant improvement in 
encoding the significance sap translates into a corre- 
sponding gaih in compress* n efficiency. 

A. Significance Map Encod 

To appreciate tbc import nee of significance map en- 
coding, consider a typical tr nsfonn coding system where 



a d (correlating trans format] 
coded scalar quantizer. Thi 
intended to be a rigorous 
encoding, but merely to pi 
of the relative coding 



contained in the significant r map ret alive to amplitude 
and sign information. 

A typical low-bit rale iraa, e coder has three basic com- 
ponents; a transformation, a quantizer and data compres- 



is followed by an cmropy- 
fotlowing discussion is not 
fi cation for significance map 
ide die reader with a sense 
of the position information 




A generic transform coder. 



sion, as shown in Fig. 3. The original image is passed 
through some transformation to produce transform coef- 
ficients. This transformation is considered to be lossless, 
although in practice this may not be the case exactly. Tbc 
rransform coefficients are then quantized to produce a 
Stream of symbols, each of which corresponds lO an index 
of a particular quantization bin. Note that virtually an of 
the information loss occurs in the quantization stage. The 
data compression stage takes the stream of symbols and 
attempts to losslessly represent the data stream as effi- 
ciently as possible. 

The goal of the transformation is to produce coefficients 
that are deconelaxed. If we could, we would ideal)y like 
a transformation to remove all dependencies between 
samples. Assume for the moment thai the transformation 
is doing its job so well thai the resulting transform coef- 
ficients are not merely UbCorrclaled, but statistically ja~ 
dependent. Also, assume thaL we have removed the mean 
and coded ft separately so that the transform coefficients 
can be modeled as zero-mean, independent, although per- 
haps not identically distributed random variables. Fur- 
thermore, we might additionally constrain the model so 
that the probability density functions (PDF) for the eoef- 
fid ems are symmetric. 

The goal is to quantize the transform coefficients so that 
the entropy of the resulting distribution of bin indexes is 
small enough SO that the symbols can be entropy- Coded at 
some target low bit late, say for example 0.5 bits per pixel 
0>pp.)- Assume thai the quantizers will be symmetric 
midiread, perhaps nonuniform, quantizers, although dif- 
ferent symmetric midtread quantizers may be used tor dif- 
ferent groups of transform coefficients. Letting the Central 
bin be index 0, note that because of the symmetry, for a 
bin with a nonzero index magnitude, a positive or nega- 
tive index is equally likely. In other words, foT each non- 
zero index encoded, the entropy code is going to require 
at least one-bit for the sign. An entropy code can be de- 
signed based on modeling probabilities of bin indices as 
the fraction of Coefficients in which tbc absolute value of 
a particular bin index occurs. Using this simple model, 
and assuming that the resulting symbols are independent, 
the entropy of the symbols H can be expressed as 

R = -p loga p - (1 - p) toga (I - p) 

+ (I -P)D + C2) 

where p is the probability that a transform coefficient is 
quantized to zero, and H HZ represents the conditional en- 
tropy of the absolntc values of the quantized coefficient* 
conditioned on them being nonzero. The first two terms 
in the sum represent the first M5rder binary entropy of the 
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bob S3 follows: 



Returning to the model* 
0.5. What is the minfrnun 
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ihe third term represents the 
distribution of nonzero values 
of thon being nonzero. Thus,, 
: of encoding the actual sym- 



Total Cost = Co: of Significance Map 

< 4»l of Nonzero Values. £3) 



, oppose that the target is H — 
probability of zero achievable? 
Consider ibe case where re only use a 3-lcvel quantizer* 
Hnz — Solving fdjr p provides a lower bound on 
the probability of zero given the independence assump- 
tion 



In this case, under the 
the coefficients must be 
83 % of the bit budget is 
map. Consider a more t; 
the minimum probability | 



\H = 0.5) 0.916. 



(4) 



si ideal conditions, 91.6% of 
jantized to zero. Furthermore, 
I in encoding the significance 
rieal example where B MZ — 4, 
bf zero is 



PaiJPtz = 4|/7 = 0.5) = 0.954. 



(5) 



In this case, the probabil 
the cost of encoding the 
the cast. 

As the target rate deci 
creases, and Lhe fraction 
io the significance map 
pCndcncc assumption is 
are often additional depeiv 
can be exploited io fuithei 
Significance map. Kevi 
ma Oct how optimal 
coder, under very typj 
mining the positions 
represents a sigmfieani 
rates, and is lilcely to 
total cost as the rate dec 
ploying an image model 
and easy to Satisfy hypoi 
significance maps of w B v {let 



B. Compression of S*gn\ 
Wavetel Cocfficierus 

To improve the cotnpi 
waveleL coefficients, a 
IS defined. A wavelet 
cam with respect to a 
zcrotree is based on the 
fie tent at a coarse scale 
given threshold T, then i 
orientation in the same 
likely to be insignificant 
deoce suggests that this 

More specifically, in 
with ihe exception of t 



Of zero must increase, while 
ignifieance map is still 54% of 

ises, the probability of zero in- 
f ihe encoding cost attributed 
Of course, the indc- 
istk and in practice, there 
Dries between coefficients thai 
reduce the cost of encoding the 
, the conclusion is that no 
orm. quantizer or entropy 
conditions, the cost of dexcr- 
few significant coefficients 
rtiOD of the bit budget at low 
an increasing fraction of the 
:es. As will be seen, by em- 
ed on an extremely simple 
is, we can efficiently encode 
coefficients. 

! Maps using Zerotrees of 



sion of significance maps of 
data Structure called a ttroirt€ 

dent J is said to be insignifi- 
threshold T if \z\ < T. The 
jpothesis that if a wavelet cOef- 
in significant with respect to a 

'avelet coefficients of the same 
tial location at finer scales are 
ith respect to 7. Empirical cvi- 
pothesis is often true, 
hierarchical subband system, 

highest frequency subbands. 



every coefficient at a given scale can be related to a Set of 
coefficients at the next finer scale of similar orientation. 
The coefficient at the coarse scale is called the parent* and 
all coefficients corresponding to the same spatial location 
at the next finer scale of similar orientation are called chil- 
dren* For a given parent, the set of all coefficients at all 
finer scales of similar orientation corresponding to the 
same location are called descendants. Similarly, for a 
given child, the set of coefficients at all coarser scales of 
similar orientation corresponding to the same location are 
called ancestors. For a QMF*p>ranud subband decom- 
position, the parent-child dependencies arc shown j& Fig. 
4. A wavelet tree descending from a coefficient in sub- 
band HH3 is also seen in Fig. 4, With the exception of 
the lowest freijncncy subband. all parents have four chil- 
dren- For the lowest frequency subband, the parent -child 
relationship is defined such that each parent node h*s thru 

children. 

A scanning of the coefficients « performed in Such a 
way that no child node is scanned before its parent. For 
an /V-SCale transform, the Scan begins al lhe lowest fre- 
quency subband k denoted as LL M , and scans subfaands 
HLfi, JUH Nl and HH^ at which point it moves on to scale 
N — 1, etc. The scanning panern for a 3-scale QMF-pyr- 
amid can be seen in Fig. 5. Note that each coefficient 
within a given subband is scanned before any coefficient 
in the next subband. 

Given a threshold level 7 to determine whether or not 
a coefficient is significant, a coefficient x i& said to be »a 
element of a zerotree tor threshold 7* if rtsdf and aU of its 
de seen dents arc insignificant with respect to T. An ele- 
ment of a zeroiree for threshold T is a zeros re e root if it 
U not the descendant of a previously found zeroiree root 
for threshold T, i.e.. it is not predictably insignificant 
from ihe discovery of a zero tree root at a coarser scale at 
the same threshold. A ZetOlrec root is encoded with a spe- 
cial symbol indicating chat the insignificance of the coef- 
ficients at finer scales is completely predictable. The sig- 
nificance map can be efficiently represented as a suing of 
symbols from a 3 -symbol alphabet which is then entropy* 
coded. The three symbols used are J) zcrotree root, 2) 
isolated zero, which means that the coefficient is insignif- 
icant but has some significant descendant, and 3) signifi- 
cant. When encoding the finest scale coefficients, since 
coefficients have no children, the symbols in ihe string 
come from a 2 'Symbol alphabet, whereby the zeruxree 
symbol is not used. 

As will be seen in Section JV, in addition to encoding 
lhe significance map, it is useful to encode the sign of 
significant coefficients along with the significance map. 
Thus, in practice, four symbols arc used; 1) zejotree root, 
2) isolated zero, 3) positive significant, and 4) negative 
Significant. This minor addition w3l be ttscfol Tor embed- 
ding. The flow chart fox the decisions made at each coef- 
ficient are shown in Fig. 6. 

Note that K is also possible to include two additional 
symbols such as *' positive/ negative significant, but des- 
cendants are zerotrees" etc. In practice, it was found mat 
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•endcnis of a ainjic coefficient is 
b a SCiotn root if K b Lcsiguifkaii 



tubbinds: Nae m*i Use arrow pomv 
jubWd of Ihc GfaOdien. The lowest 
the highest frequeocy nibbud is 91 
- welei iree coftihlicg of of rfcc dc- 
*X>*a4HH3. The cocmcknl in */T3 
And all Cf ils deice^dula arc iuit£- 



Fifi. 3. Scanorag ovtfcf of (he 
Note patents mxtn be fcan>*cd 
litions in ■ fives subbud >jc kiuic I 
sttbfaiMd, 



nbbaj )i 
*d be c 
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l« for encoding a *j£Riflc««cr map: 
ore children. Also note iKm ail po- 
beferc {he scan movci io Uk oexi 



at low bil rales, this addilioi often increases the cost of 
coding fee significance map; To see why this may occar. 
consider that there is a cost Lssociaicd with partitioning 
ihc set of positive (or negati jx) significant samples into 
those whose desceiidcnt* arc ; erolrees and those with sig- 
nificant descendants. If (he ct si of this decision is Cbiis, 
but the cost of encoding a ze otree is less than C/4 bits! 
fhen h is mora efficient 10 cod : four zcrotrce symbols sep- 
arately than to use additional jkymbols. 

Zcroiree coding reduces t± : cost of encoding the sig- 
nificance map using self-simi srily. Even though the im- 
age has been transformed usii |g a decorrclating transform 
the occurrences of insigmfica it coefficients are hot inde- 
pendent events. More tradiii -innl techniques employing 
transform coding typically 
some fbrrn of run- length enc 
Iree symbol, which is a SingL 
applies to all cree-depths. rui 
symbol Tor each ru a- length 
technique that is closer in spL 
of-block (EOB) symbol used 
a •"lerminating" symbol 



«dc the binary map via 
ling (30J. Unlike the zero- 
'termmating'* symbol and 
length encoding requires a 
[hich much be encoded. A 
to the zerotFees is the end- 
JPEC T30L which is also 
[eating that all remaining 



B „ rf . •» • » £3 *"# ' W 141111-11 

DCT coefficients in the bloc! are quantized to zero. To 




0*W Code Cede Code 

SjnabcJ SyodHd Sjabd Symrfwl 

Fi». *. Row chan for encoding a coefficient of the lifnificarcc nap. 

See why zero trees may provide an advantage over EOS 
symbols, consider* chat a zcroiree represents the insignif- 
icance information io a given oriedtation over an approx- 
imately square spatial area at ail finer scales op to and 
including the scale of the zerotree root. Because the wave- 
let transform is a hierarchical representation* Varying the 
scale in which a zcroiree root occurs automatically adapts 
the spatial area over which insignificance is represented. 
The EOB symbol, however, always represents insignif- 
icance over the same spatial area, although the number of 
frequency bands within that spatial area varies. Given a 
fixed block size, such as 8 X ft, there is exactly one scale 
in the wavelet transform in which if a zerOlree root is 
found at that scale, it corresponds to the same spatial area 
as a block of the DCT. If a zerotree root can be identified 
at a coarser scale, then xbe insignificance pertaining to 
thai orientation can be predicted over a larger area. Sim- 
ilarly, if the zcrotree root does not occur be ihis scale* then 
looking for zero trees at finer scales represents a hierar- 
chical divide and conquer approach to searching for one 
or more smaller areas of insignificance over the same spa- 
tial regions as the DCT block size. Thus, many more coef- 
ficients can be predicted in smooth areas -where a root typ- 
ically occurs at a COarSc scale. Furthermore, the zerotree 
approach can isolate inieresling non-zero details by im- 
mediately duninating large insignificant regions from 
consideration. 

Note that this technique is quite different from previous 
attempts to exploit self-similarity in image coding T19] in 
that it is far easier to predict insignificance than to predict 
significant detail across scales. The zerotrec approach was 
developed in recognition of the difficulty in achieving 
meaningful bil rate reductions for significant coefficients 
via additional pjedicrion. Instead , the focus here is on re- 
ducing ihe cost of encoding the significance map so thai, 
for a given bit budget, more bits are available to encode 
expensive significant coefficients. In practice, a large 
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A similar technique 
Knowlcs (LK) (IS], [16 



fraction of the insigniiic nt coefficients «rc efficiently en- 
coded as pair of a zerotij :c. 

has been used by Lewis and 
In that work, a "tree" U said 
to be zero if i» energy Ik less than a perceptually based 
threshold. Also, the "z* *o teg* 1 used to encode the iree 
is not entropy-coded. TY : present work represents an im- 
provement thai allows f< r embedded coding for two rea- 
sons. Applying a dctermj lislic threshold co determine sig- 
nificance results in a re otree symbol which guarantees 
(hat no descendant of th< root has a magnitude larger than 
the threshold. As a result there is no possibility of a sig- 
nificant coefficient bcingjj obscured by a statistical energy 
measure. Furthermore, tl e zerotrce symbol developed in 
this paper is pari of am Jphabet for entropy coding die 
significance map which f irther improves its compression. 
As will be discussed sub cqucntly, it is the firs I property 
that makes this method if encoding a significance map 
useful in exjunction m , succ^sive-approximatioii. Re- 
cently, a promising techr L^uc represe ruing a compromise 
between the EZW algorithm add the LK coder has been 
presented in [34). 



C. Interpretation as a Si 
The basic hypothesis— 



nj>U Image Model 

fa coefficient at a coarse scale 
is insignificant with resp ct to a threshold then all of its 
descendants, as defined a ovc, sue also insignificant— can 
be imerprcicd as an extrc bcly general image model. One 
of the aspects thai seams to be common co most models 
used to describe images k that of a "decaying spectrum." 
For example, this proper y exists for both stationary au- 
d non-stationary fractal, or 
implied by the name which re- 
fers to a generalized spc nrum p3j. The model for ihe 
zerotrec hypothesis is evqji more general than "decaying 



I examp* 



spectrum" in thai it alio 
cay ing spectrum' * becaus 
old. Consider an 
we are considering a 
whose largest descend: 
though a higher 
mtnde (40) than the 
i.e., the "decaying speci 
coefficient under considci 
ing a zerotrec root since 
(magnitude less than 
mon image models 
pothesis should be saiii 
For those instances 
safe to say that an inj 
has occurred, and we 
ing this event to be com: 
lion 

It should also he poi 
encoding significance 



1 50). 
have 



i wher 



'S for some deviations to ' 'de- 
it is linked to a specific thresh- 
where the threshold is 50* and 
tfficiort of magnitude 30, and 
has a magnitude of 40. Al- 
descendant has a larger mag- 
:Iem under consideration (30), 
' hypothesis is violated, the 
[lion Can still be represented us- 
whole tree is still insignificant 
~ius, assuming the more com- 
mie validity, the zerotree hy~ 
* easily and extremely often, 
jthc hypothesis is violated, it is 
)ari\f t i.e., unexpected, event 
expect the cost of represent- 
Jcasurace with its sclf*inforrna- 



OUt that the improvement m 
provided by zero-trees is spe- 
cifically net the result of \ xpj oiling any linear dependen- 
cies between coefficienxs < f different scales that were not 



removed in the transform stage. In practice, the linear 
correlation between the Values of parent and child wavelet 
coefficients has been found to be extremely small, imply- 
ing that the wavelet transform is doing an exceUeni job of 
producing nearly uncorrelated coefficients. However, 
there is likely additional dependency between the squares 
(cr magnitudes) of parents and cllildrcn. Experiments run 
on about 30 images of all different types, show thai the 
correlation coefficient between the square of a ChDd and 
the square of its parent tends to be between 0,2 and 0.6 
with a string concentration around 0.35, Although this de- 
pendency is difficult to characterize in general for most 
images „ even without access to specific statistics, ii is rea- 
sonable to expect the 'magnitude of a child to be smaller 
than the magnitude of its parent. In other words* it can be 
reasonably conjectured based on experience with reai- 
world images, that had we known the details of the sxa- 
tisrteal dependencies, and computed a* "optimal" esti- 
mate, such as the conditional expectation of the child's 
magnitude given the parent's magnitude, that the '"opti- 
mal" estimator would, with very high probability, predict 
thai the child's magnitude would be the smaller of the 
two. Using only this mild assumption* based on an inex- 
act statistical characterization, given a fixed threshold, and 
conditioned on the knowledge xbata parent is insignificant 
with respect to the threshold, the "optimal" estimate of 
the significance of the rest of the descending wavelet tree 
is that it is entirely insignifiraot with respect to the Same 
threshold, i.e., a zerotrec. On the other band, if the parent 
is significant, ihe "optimal'' estimate of the significance 
of descendants is highly dependent on the details of ihe 
estimator whose knowledge would require more detailed 
mforrfiarion about die statistical nature of the image. Thus, 
under this mild assumption, using zero trees co predict die 
insignificance of wavelet coefficients at fine scales given 
the insignificance of a root at a coarse scale is more likely 
to be successful in the absence of additional in Formation 
than attempting to predict significant detail across scales. 

This argument can be made more concrete. Let x be a 
child of >, where x and y arc zero-mean random variables, 
whose probability density functions (PDF) are rclaicd as 

/>,W = cp,{ax). a > 1. (6) 

This states that random variables x and y have the same 
PDF shape, and that 



Assume further that x and y are ancorrelatcd. i.e. 



^Uy] = o. 



(S) 



Note that nothing has been said about treating the sob- 
bands as a group, or as stationary random processes „ only 
thai there is a similariry relationship between random 
variables of parents and children. It is also reasonable be- 
cause for intermediate subheads a coefficient thai is 8 child 
with respect to one Coefficient is a parent with respect U> 
others; the PDF of that coefficient should be the same in 
either case. Lei u = jc 2 and v = y 2 . Suppose that u and 
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v arc corrected with cor elation coefficient p. We have 
the following relationship : 



£M = - 

JEW = 



Notice in particular thai 



(9) 
(10) 
(11) 
(12) 

(13) 

Using a well known resii l, the expression for the best 
linear unbiased estimator ffBLUE) of h given v \o mini- 
mize error variance is givdn by 



I - 



If it is observed thai the m gnitude of The parent is below 



the threshold T, i.e., v m 
be upper bounded by 



y 2 < T 2 , then the PVUE can 



- P~ tEM - v) 



V 



(M) 



(15) 



1 - P » T l 



(16) 



Consider two eases a) 7 S tr^ and b) T < <v In case (a), 
we have 



r 2 ) - ^ < t\ 
0 



07) 



which implies lh» the BLtiE of jc 1 given | >| < 7 is less 
than r a , for ony p, wcludAg p = 0. In case (b), we can 
only upper bound the righjj hand Side of (16) by rf p 
exceeds the lower bound 



(IS) 



Of course, a better nonlft>e?r estimate might yield dif- 
ferent results, but the abAe analysis suggests that for 
threshold exceeding the s^sfidard deviation of the parent, 
which by (6) exceeds the standard deviation of all de- 
scendants, if it is observe* thai a parent is insignificant 
with respect to the thresholc , then, using the above BLUE, 
the estimates for the magni udes of all descendants is that 
ihey are less than the Lhie&r. >ld, and a zeroirce is expected 
regardless of the correlatk a between squares of parents 
and squares of children. As the threshold decreases, more 
correlation is required to j isiify expecting a zerOtree to 
occur. Finally, sioce the 1c */cr bound fio — l«T-»0, 
as the threshold is reduced! it becomes increasingly dif- 

« H tlr *. »- . I 1 1 I 



ficuU to expect acrotrecs U 



of the particular Statistics ai : required io make inferences. 
The implication of this an irysis is that at very low bit 



occur, and more knowledge 



rates, where the probability of an insignificant sample 
must be high and thus, the significance threshold T must 
also be large, expecting the occurrence of ZCtutrees and 
encoding significance maps using zeroizsc coding is ira- 
sonable without even knowing the statistics. However, 
letting T decrease, there is some point below which the 
advantage of zero tree coding diminishes, and this point is 
dependent on the specific nature of higher order depen- 
dencies between parents and children. In particular, the 
stronger this dependence, the more T Can bo decreased 
while still retaining an advantage using zerotree coding. 
Once again, this argument is not intended to 14 prove" the 
opiimaJity of zerotree coding, only to suggest a rationale 
for its demonstrable success. 

D. 2*rotrt4-Uke Structures in Other Sub band 
Configurations 

The concept of predicting the insignificance of coeffi- 
cients from low frequency to high frequency information 
corresponding to the same spatial localization is a fairly 
general, concept and not specific to the wavelet transform 
configuration shown in Fig. 4. Zeroixeei are equally ap- 
plicable lo quincunx wavelets £2J. tl3], [23}, [29], in 
which case each parent would have two children instead 
of four, except for the lowest frequency, where parents 
have a single child. 

Also, a similar approach can be applied to linearly 
Spaced slkbband decompositions, such as the DCT, and to 
other more general subband decompositions, such as 
wavelet packets [5\ and Laplacian pyramids [4]. For ex- 
ample, one of many possible parent-child relationship for 
linearly spaced subbands can be seen in Pig. 7. Of course, 
with the use of linearly spaced subbands, zcrotrcc-Iike 
coding loses its ability to adapt the spatial extern of the 
insignificance prediction. Nevertheless, it is possible for 
zerotrec-iike coding to outperform EOB-coding since 
more coefficients can be predicted from the subbands 
along the diagonal. For the case of wavelet packets, the 
situation is a bit more complicated, because a wider range 
of tilings of the ' ' space -frequency * * domain are possible. 
In that ease, it may not alway* be possible to define sim- 
ilar parent -child relationships because a high-frequency 
coefficient may in fact correspond to a larger spatial area 
than a co-located lower frequency coefficient. On the other 
hand, m a coding scheme such as the "best-basis" ap- 
proach of Coifrnan et al. [SJ, had the image-dependent 
best basis resulted in such a situation, one wonders if the 
underlying hypothesis— thai magnitudes of coefficients 
tend to decay with frequency— would be reasonable any- 
way. These zero tree-like extensions represent interesting 
areas for further research. 

IV. 5uCC>SSJVE>AP1*ftOxiMATtO>? 

The previous section describes a method of encoding 
significance maps of wavelet coefficients that, at least em- 
pirically, seems to consistently produce a code with a 
lower bit rate than cither the empirical £rsw>rder entropy. 
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such mx the DO". K«fe ifaai the 
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subbntHl is *1 Ihc bottom right. 



or a run-length code of th< significance map. The original 
motivation far employing 



successive-approximation in 
conjunction with leimrec soding was that since icroircc 
coding was performing £0 
map of ihc wavelet coeffi 



we)] encoding the significance 
i«nts, it was hoped that more 
efficient coding could be ac tiieved by zerotree coding more 
significance maps. 

Another motivation forj sue ccisive-apprDxirna lion de- 
rives directly from the go J of developing an embedded 
code analogous to the bins y-rcprcsentarion of an approx- 
imation to a real number. Consider die wavelet transform 
Of an image as a mapping thereby an amplitude exists for 
each coordinate in scale-Space, The scale-space coordi- 
nate system represents af coarse-U>fine "logarithmic" 
representation of the dom; in of the function. Taking the 
coarsc-tc-fioc philosophy i ne-step further, successive-ap- 
proximation provides a coarse -io-finc, mulfiprecision 
"logarhhmjc" rtpreScnlal on of amplitude information > 
which can be thought of as he range of the image function 
when viewed in the sc&lc-s tace coordinate system defined 
by the wavelet transform, rtrus, in a very real sense, the 
E2W coder generates a it KesenLaticn of the image thai 
is coarse-to-fine jn both t e domain and range simulta- 
neously. I 

A. Succcssive-Approximat on Entropy-Coded 
Quawitaiion 

To perform the embedded coding, successive- approxi- 
mation quantization ($AQ is applied. As will be seen, 
SAQ is related to bit-plan encoding of the magnitudes. 
The SAQ Sequentially ap] lies a sequence of thresholds 
7*o* " ' * „ 7*.i to deter line significance, where (he 
thresholds are chosen so lai 7) = T._,/2. The initial 
threshold T 0 is chosen so d at |Jf/| < 2T 0 for ali transform 
coefficients ij. 

During the encoding (an I decoding), two separate lists 
of wavelet coefficients arc nainlaincd. At any point in the 
process, the dominant lis contains the coordinates of 
those coefficients that haw not yet been found to be sig- 
nificant in the same rclatW order as the initial scan. This 
scan is such that the sub* mds are ordered, and within 
each subband, the set of t efficients are ordered. Thus, 



using the ordering of the subbands shown in Fig. 5, all 
coefficients in a given subband appear on Che initial dom- 
inant list prior CO coefficients in the next subband. The 
Subordinate lis\ contains the magnitudes of those coeffi- 
cients ihai have been found to be significant. For each 
threshold, each list is scanned once. 

During a dominant pass, coefficients with coordinates 
on ihc dominant list, i.e.. those (hat have not yet been 
found to be significant, are compared to the threshold T ( 
to determine their significance, and if significant, their 
sign. This significance map is then zerotree coded using 
the method outlined in Section III. Bach time a coefficient 
is encoded as significant, (positive or negative) , ils mag- 
nitude is appended to the subordinate list, and the coeffi- 
cient in the wavelet transform army is set to zero so thai 
the Significant coefficient does hot prevent the occurrence 
of a zerotree on future dominant passes at smaller thresh- 
olds. 

A dominant pass is followed by a subordinate pass in 
which all coefficients on the subordinate list are scanned 
and the specifications of the magnitudes available to the 
decoder are refined to an additional fait of precision. More 
specifically, during a subordinate pass, the width of the 
effective quantizer step size, which defines an uncertainty 
interval for the true magnitude of the coefficient, is cut in 
half. For each magnitude on the subordinate list, this re- 
finement can be encoded using a binary alphabet with a 
"1" symbol indicating thai the true value falls in the up- 
per naif of the old uncertainty interval and a "0" symbol 
indicating the lower half. The string of symbols from this 
binary alphabet that is general cd during a subordinate pass 
is then entropy coded. Note ihat prior to this refinement, 
the width of the uncertainty region is exactly equal to the 
current threshold. After the completion of a subordinate 
pass ihe magnitudes on the subordinate list are sorted in 
decreasing magnitude, to the extent that the decoder has 
the m formation to perform the same sort. 

The process continues in alternate between dominant 
passes and subordinate passes where the threshold is 
halved before each dominant pass. (In principle one could 
divide by other factors than 2. This factor of 2 was chosen 
here because it has nice interpretations in terms of bit 
plane encoding and numerical precision in a familiar base 
2, and good coding results were obtained). 

In the decoding operation, each decoded symbol, both 
during a dominant and a Subordinate pass, refines and re- 
duces the width of ihc uncertainty interval in which the 
true value of the coefficient (or coefficients, in the case of 
a Zerotree root) may occur. The reconstruction value used 
can be anywhere in that uncertainty interval. For rmni- 
mum mean-square error distortion, one could use the cen- 
troid of the uncertainty region using some model for the 
PDF of the coefficients. However, a practical approach, 
which is used in Ihe experiments, and is also MINMAX 
optimal, is to simply use the center cf the uncertainty in- 
terval as the recansirucrion value. 

The encoding Slops when some target stopping condi- 
tion is met, such as when the bit budget is exhausted. The 
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at the end of the code that 
since a codeword has bci 
bits do not reduce the wi 
any distortion function, li 
first L bits of the bii strcai 
image as the first JL + 1 bi 
bh is insufficient to coi 
Symbol. Nevertheless, u 
embedded bii aires m ax a 
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encoding can cease at any i me and [he resulting bit stream 
contains all lower rate ei endings* Note, that if the bit 
Stream is truncated at an a; tritrary point, there may be bits 
wo not decode lo a valid symbol 
\ truncated. In that case, these 
i of an uncertainly interval of 
I fact, it is very likely that ihc 
will produce exactly the same 
; which occur* if the additional 
|>lete ihc decoding of another 
i tearing the decoding of an 
Ifcpeciflc point in the bit stream 
produces exactly the same | image that would have resulted 
had that point been the in tial target rate. This ability to 
cease encoding or decodin \ anywhere is extremely useful 
is systems that are either rate-constrained or distortfon- 
eonsuained- A side benctt of the technique is thai an op- 
erational rate vs. distortio| plot for the algorithm can be 
computed on- tine. 



B. Relationship to BH Pity* Encoding 

Although the embedded] coding system described here 
is considerably more genop) and mone sophisticated than 
simple bit-plane encoding! consideration of the relation* 
ship with bit-plane encoding provides insight into the suc- 
cess or embedded coding. 

Consider the siKcessive-bpproximation quantizer for the 
case when all thresholds a je powers of two, and all Wave- 
let coefficients are integer ,. In this case, for each coeffi- 
cient that eventually gets coded as significant, the sign 
and bit position of eke j most-significant binary digit 
(MSBD) aze measured ai d encoded during a dominant 
pass. For example! consi< er the 10-bit representation of 
the number 41 as 0000)0 001- Also, consider the binary 
digits as a sequence of bi ary decisions in a binary tree. 
Proceeding from left to ri; ht, if we have not yet encoun- 
tered a * ' I »c expect th probability distribution for the 
next digit to be strongly bi ised toward "0." The digits to 
: i*l SB D arc called the dominant 
> ing dominant passes- After the 
MSBD has been encounte red, wc expect a more random 
and much less biased dis ribation between a "0** and a 
although we mighi stiB expect p <0) > PW be- 
cause roost PDF models I jr transform coefficients decay 
with amplitude. Those b nary digits to the righi of the 
MSBD are called the sub vr&nme bits and arc measured 
and encoded doling the s< b ordinate pass. A zc roth-order 
approximation suggests th it wc should expect to pay dose 
to one bit per "binary <Ji; it" for subordinate bits, while 
dominant bits should be f r less expensive. 

By using &iece$sjve~ap mximation beginning with the 
where the probability of zero, 
and by using zerotree coding, 
as the probability of zero in- 
we should be ai le to code dominant bits with 
very few bits, since they a re most often part of a zeroirce. 
Jn general, Ihe thrcshc! Is need not be powers of two. 



largest possible ihTcshold 
is extremely close to one 
whose efficiency 



HowevcT. by factoring out a constant mmrimn T M f the 
starting threshold T 0 can be expressed in terms of a thresh- 
old that is a power of two 

7o = Af2 r , (19) 

where the exponent if is an integer, in which case, the 
dominant and subordinate bits of appropriately scaled 
wavelei coefficients are coded during dominant and sub- 
ordinate passes ( respectively. 

C. Advantage o/SmoU Alphabets for Adaptive 
Arithmetic Coding 

Note that the particular encoder alphabet used by the 
arithmetic coder ai any given time contains either 2, 3, or 
4 symbols depending whether the encoding b far a sub- 
ordinate pass, a dominant pass with no zerotrec root sym- 
bol, or a dominant pass with the zerotree root symbol. 
This is a reaJ advantage for adapting the arithmetic coder. 
Since there are never more than four symbols, all of the 
possibilities typically occur with a reasonably measurable 
frequency. This allows an adaptation algorithm with a 
short memory to learn quictty and constantly track chang- 
ing symbol probabilities. This adaptiviry accounts for 
some of the effective ness of the overall algorithm. Con- 
trast Ihis with (he case of a large alphabet, as is the case 
in algorithms that do not use successive approximation- 
Jn that easeC it takes many events before an adaptive en- 
tiopy coder can reliably estimate the probabilities of un* 
lively symbols (see the discussion of the zero-frequency ' 
problem in [3]). Furthermore* these estimates are fairly 
unreliable because images arc typically statistically non- 
stationary and local symbol probabilities change from re- 
gion to region. 

In the practical coder used in the experiments, the arith- 
metic coder is based on [31]. In arithmetic coding, the 
encoder is separate from the model, which in {31}. Is bas- 
ically a histogram. During the dominant passes, simple 
Markov ccrati Dotting is used whereby one of four histo- 
grams is chosen depending on 1) whether the previous 
coefficient in the scan is known to be significant, and 2) 
whether the parent is known to be significant. During the 
subordinate passes, a single histogram is used. Each his- 
togram entry is initialized to a count of one. Alter encod- 
ing each symbol, the corresponding histogram entry is in- 
cremented. When the sum of all Ihe counts in a histogram 
reaches the maximum count, each entry is ucrcrneeted 
and integer divided by two, as described in [31 J. ll should 
be mentioned, that for practical purposes, the coding gains 
provided by using this simple Markov rnndrnrmiriE , may 
not justify the added complexity and using a single his- 
togram strategy for the dominant pass performs almost as 
well (0. 12 dB worse for Lena at 0.25 bop-)- The choice 
of maximum histogram count is probably more critical, 
since that controls the learning rate for the adaptation. For 
the experimental results presented , a maximum count of 
256 was used, which provides an tntermediate tradeoff be- 
tween the smallest possible probability, which is the re- 
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ctprocal of the maximu n 
which is faster -with a sn\ dl 



D. Order of Importance 
Although importance 



count , and the learning rate, 
ler maximum histogram count. 



f \ of the Bits 

- ■■ - r j- * subjective zcrm. the order of 

processing used in EZWfl implicitly defines & precise or- 
dering of importance thai is lied to, in order, precision, 
magnitude, scale „ and spiuial location, as determined by 
the initial dominant list. 

The primary detennin tic-n of ordering importance is 
the numerical precision if the coefficients. This can be 
seen in the fact thai the u icertainty imervals for the mag- 
nitude of all coefficients j re refined to the same precision 
before the uncertainty in erval for any coefficient is re- 
fined further. 

The second factor in th determination of importance is 
magnitude. Importance: b; magnitude manifests itself dur- 
ing a dominant pass beca ise prior to the pass, all coefiV 
cienls arc insignificant ai d presumed to be zero. When 
they arc found to be sign ficani, they are all assumed 10 
have the Same magnitude which is greater thin the mag- 
Is thai remain insignificant. Im- 
inifests itself daring a subordi- 
|at magnitudes are refined in 
•enter of the uncertainty inter- 
i teipretation of tbe magnitude. 
The third factor, scale, manifests itself in the a prion 
ordering of the subbands 0 1 the initial dominant list. Until 
the significance of ihe mi gnitude of a coefficient is dis- 
covered during a domic* u pass, coefficients in coarse 
scales are tested for signifi »nce before coefficients In fine 
scales. This is consistent \ ith prioritization by the decod- 
er's version of magnitude it rice for all coefficients not yet 
found to be significant, th : magnitude is presumed to be 
zero. 

The final factor, spatia location . merely implies that 
two coefficients that cann * yet be distinguished by the 
decoder in terms of cither ireraion, magnitude, or scale, 
have their relative import nee determined arbitrarily by 
the initial scanning order >f the subband containing the 
two coefficients. ' 

In one sense, this embed ling strategy has a sirietly non- 
ihCreasing operational disl< rtion-ratc function for the dis- 
tortion metric defined to b : the sum of the widths of the 
uncenainry intervals of aft of ihe wavelet coefficients. 
Since a discrete wavelet i4usforra is an mvenible repre- 
sentation of an image, a diStonion function defined in the 
wavdel transform domain gs also a distortion function de- 
fined on the image. This isionion function is also not 



niludes of those cDeffitierJ 
po nance by magnitude \ 
nate pass by the fact <| 
descending order of the ■ 
vals, i.e., the decoder's : 



without a rational found; 



ion for low-bit rate coding. 



where noticeable artifacts i nisi be tolerated, and percept 
tual metrics based on just-: oticesbJe differences (JND's) 
do not always predict whic > artifacts human viewers will 
prefer. Since minimizing tl c widths of uncertainty inter- 
vals minimises the largest p issible errors, artifacts, which 
result from numerical erroi ; large enough to exceed per- 
ceptible thresholds, are mi limizcd. Even using this dis- 
tortion function, the propo cd embedding strategy is not 



optimal , because truncation of the bit stream in the middle 
of a pass causes some uncertainty intervals to be twice as 
large as others. 

Actually, as it has been described thus far, EZW is mi- 
likely to be optimal for any cHstonion function. Notice 
that in (19), dividing tbe thresholds by two simply dec- 
rements E leaving M unchanged. While there must exist 
an optimal starting M which minimizes a given distortion 
function, how to find this optimum is still an opes ques- 
tion and seems highly image dependent. Without knowl* 
edge of the optimal M and being forced to choose it based 
on some other consideration, with probability one. either 
increasing or decreasing Af would have produced an 
embedded code which has a lower distortion fox the same 
rate. Despite the fact thai without trial and error optimi- 
zation for Af, EZW is probably suboptzmal, it is never* 
thclcss quite effective in practice. 

Note also that using the width of the uncertainty inter- 
val as a distance metric is exactly the same metric used in 
finite-precision fixed-point approximations of real num- 
bers. Thus, the embedded code can be seen as an "im- 
age" generalization of finite-precision fixed-point ap- 
proximations of real numbers. 

£. Relationship to Prioriry-Posilion Coding 

In a technique based on a very similar philosophy, 
Huang et al. discusses a related approach to embedding, 
or Ordering the information in importance, called prioriry- 
posiiion coding (PPQ [10]. They prove Very elegantly 
thai the entropy of a source is equal to the average entropy 
of a particular ordering of that source plus the average 
entropy of the position information necessary to recon- 
simci the source. Applying a sequence of decreasing 
thresholds, they attempt to sort by amplitude all of the 
DCT coefficients for the entire image based on a panrtion 
of the range of amplitudes. For each coding pass, they 
transmit the significance map which is arithmetically en- 
coded. Additionally, when a significant coefficient is 
found they transmit its value to Us full precision. Like the 
EZW algorithm, PRC implicitly defines importance whh 
respect to die magnitudes of the transform cocfficienis. In 
one Sense. PPC is a generalization of the successive-ap- 
proximation method presented in this paper, because PPC 
allows more general partitions of the amplitude range of 
the transform coefficients, On the other hand, ppC 
send* the value of a significant coefficient to full preci- 
sion, its protocol assigns a greater importance to the least 
significant bit of a significant coefficient than to the iden- 
tification of new significant coefficients oa next PPC pass. 
In contrast, as a top piioriiy. EZW tries to reduce the 
width of the largest uncertainty interval in all coefficients 
before increasing the precision further. Additionally, PPC 
makes no attempt to predict insignificance from low fre- 
quency to high frequency, relying solely on the arithmetic 
coding to encode the significance map. Also unlike EZW, 
the pTObahHiry estimates needed for the arithmetic coder 
were derived via training on an image database instead of 
adapting to the image itself. It would be interesting to 



08/19/2005 FRI 01:28 [TX/RX NO 9323] @029 



20 0 5$ 8E19B miM 



RICOH LEGAL DIV. 



NO. 2817 — P. 30- 



3*26 



1EE£ TRA>3aCTTOKS ON SIGNAL PROCESSING. VOL. •* I. NO. 12. SEC EMS 131 IV» 



experiment with variations 
EZW (wavelet transforms, 
defined by a decreasing 
and adaptive arithmetic 
the more general approach 
plitudes found in PPC- Xn 
whether the -finest grain 
provides ary coding gain, 
higher computational cost 
Additionally, with the 
plitude passes, the coding 
power- of- two amplitudes 
ing that, in practice, Uiinj 
coding gain. 



vhich combine advantages Of 
aerotrec coding. importance 
of uncertainly intervals, 
using small alphabets) with 
partitioning the range of axn- 
djracticc. however, h is unclear 
on ing of the amplitude range 
ttd there is certainly a much 
associated with more passes, 
of the last few low-am- 
ij^sults reported in [10] did use 
define the partition suggest- 
finer partitioning buys little 



seqi ence 
codi rig 



1 part lii 



cxe« piion i 



V. A Sim; ue Example 
In this section, a simple example will be used to high- 
light the order of operation , used in the EZW algorithm. 
Only the string of symbols twill be shown. The reader in- 
terested in the details of ad ipUve arithmetic coding w re- 
ferred to [31]. Consider tin simple 3-SCalc wavelet trans* 
form of an 8 X 8 image. *I he array of values is shown in 
Fig. 8, Since the largest ciefficient magnitude is 63. we 
can choose our initial thres iold to be anywhere in (31 -5, 
63 1. Lei T 0 »* 32. Table 1 shows the processing on the 
first dominant pass. The fo lowing comments refer 10 Ta- 
ble I: 

1) The coefficient pas r ggnitude 63 which U greater 
than Lhe threshold 32, and i i positive so a positive symbol 
is generated. After decod ng mis symbol, lhe decoder 
knows rhe coefficient m the interval [32, 64) whose center 
13 48. 

2) Even though the coe Pictent 31 is insignificant with 
respect to the threshold 32. it has a significant descendant 
two generations down Lit ubband LH1 with magnitude 
47. Thus, the symbol for f * isolated zero is generated. 

3) The magnitude 23 ii less than 32 and all descen- 
dants which include (3, — 1 l % -14. 8) in subband HH1 and 
all coefficients in subband 1H\ are insignificant. A Zero- 
tree symbol is generated, a id no symbol will be generated 
for any coefficient in Subb nds HH1 and HH 1 during the 
current dominant pass. 

4) The magnitude 10 i: leas than 32 and all descent 
darns (-12, 7. 6, -1) also nave magnitudes less man 32. 
Thus a Zerouee symbol is generated. Notice that ibis ace 
has a Violation of the "d caying spectrum'' hypothesis 
since a coefficient (-12) in subband HLX has a magnitude 
greater than its parent (10 . Nevertheless, the entire tree 
has magnitude less than t te threshold 32 so it is still a 
lenjtrcc- 

5) The magnitude 14 is nsigmncant with respect to 32. 
Its children are (-1, 47, -3 2). Since its child with mag- 
nitude 47 U significant, & \ isolated zcto symbol is gen- 
erated. 

6) Note that no symbol were generated from subband 
HH 2 which would ordinar ly precede subband HLX in the 
scan. Also note that sine J subband HLA has no descen- 
dants, the entropy coding ean resume using a 3 -symbol 
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alphabet where, the Xt> and ZTR symbols are i 
the 2 (zero) symbol. 

7) The magnitude 47 is significant with respect to 32. 
Note that far the furore dctariaani passes, this position will 
be replaced with the value 0, so that for the next ctomraaM 
pass at threshold 16. the parent of this coefficient* which 
has magnitude 14, can be coded using a zero tree root 
symbol. 

During the first dominant pass, which used a threshold 
of 32, four significant coefficients were identified. These 
coemcients will be refined during the first subordinate 
pass. Prior to the first subordinate pass, the uncertainly 
interval for the magnitudes of all of the significant coef- 
ficient is the interval [32. 64). The first subordinate pass 
will refine these magnitudes and identify Uicm as being 
either in interval 132, 4fi), which will be encoded with the 
symbol "0," or in the interval (48, 64), which win be 
encoded with the symbol J * 1 .** Thus, the decision bound- 
ary U the magnitude 48. It is no coincidence that these 
symbols are exactly the first bit to the right of the MSBD 
in the binary representation of the magnitudes. The order 
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of operations in the first si bordinacc pass is j] lustra led in 
Table JL 

The first entry has mag litude 63 and is placed in the 
upper interval whose cea =r is 56- The next entry has 
magnitude 34, which plac s it in the lower interval. The 
third entry 49 is in the upp t interval, and the founh entry 
47 is in the lower raiervaj Note that in the case of 47, 
using the center of the urn ertainty interval as the recon- 
struction value, when the I ^construction value is changed 
from 4$ to 40, the reconst action error actually increases 
from 1 to 7. Nevertheless, he uncertainty interval for this 
coefficient decreases from width 32 to width J 6. At the 
conclusion of the process i g of the entries on the subor- 
dinate list Dorresponding E > the uncertainly interval [32, 
64), these magnitudes arc I ordered for future Subordinate 
passes in the order (63, 49. W, 47). Note that 49 is moved 
ahead of 34 because from t * decoder's point of view, the 
reconstruction values 56 ai 1 40 are distinguishable. How- 
ever, the magnitude 34 rc nains ahead of magnitude 47 
because as far as the dccoler can ten. both have magni- 
tude 40. and' the initial aider, which is based first on im- 
portance by scale, has 34 jjjrior to 47. 

to the second dominant pass 
During this pass* only those 
be Significant are Scanned. 
TtHs previously found to be 
i for the purpose of de-ermbv 
s, the second dominant pass 
efficient -31 m subband Lr/3 
□efficient 23 in subband HH3 
\ three Coefficients in subband 
£ly found to be significant 
ded as zerotree roots, as arc 
L LH1 and all four coeffi- 
; second dominant pass ter- 
| all other cocfHcients are pre- 



The process continues 
at the new threshold of \ 
coeftcienzs not yet found 
Additionally,, (hose 
significant are treated as 
big if a axrotree exists, 
consists of encoding the 
as negative significant, i 
as positive significant. 
Ml that have not been 
(10. 24. -13) are each 
all four coefficients in sul 
cients in subband HH2 
mi nates at this point sine 
dicubly imjgni&cam 

The subordinate list no' 
□iiudes (63, 49, 34, 47, 3 



contains, in order, the mag- 
23) which, prior to this sub- 
ordinate pass , i e p r e sen t thefrhrec uncertainty intervals (48, 
64), [32, 48) and (16, 31) each having equal width 16. 
The processing wul refine i tcb magnitude by creating two 
new Uncertainty intervals f ir each of the three current un- 
certainty intervals- Ar the of the second subordinate 
pass, the order of the magnitudes is {63. 49. 47. 34. 31, 
23), since at dris point, thJ decoder could b&vc identified 
34 and 47 as being in diffe cm intervals- Using the center 
of the uncertainty interval ; s the reconstruction value, the 
decoder lists the rnagnilud* s as (60, 52, 44, 36, 28, 20). <=u*i. 



The processing continues alternating between dominant 
and subordinate passes and can stop at any time. 

VI. Hxr-E&lMfXTAL Results 
All experiments were performed by encoding and de- 
coding an actual bil Stream to verify the correctness of the 
algorithm.- After a 12-byte header, the entire bil stream is 
arhbmctically encoded using a single arithmetic coder 
with an adaptive model [31]. The model is initialized ai 
each new threshold for each of the dominant and subor- 
dinate passes. From chat point, the encoder is fully adap- 
tive. Note in particular that there is no training of any 
land, and no ensemble statistics of images arc used in any 
way (unless one calls the zerotree hypothesis an ensemble 
statistic). The 12-byte header contains 1) the number of 
wavelet scales, 2) the dimensions of the unqgc, 3) the 
maximum histogram count for me models in the arith- 
metic coder. 4) the image mean and 5) the initial thresh- 
old. Note I hat after the header, there is no overhead ex- 
cept for an extra symbol for end-of-bU-Stream, which is 
always maintained at minimum probability. This axna 
symbol is not needed for storage on computer medium if 
the end of a file can be detected. 

The EZW coder was applied to the standard black and 
white 8 bpp. test images, 512 X 512 "Lena** and the 512 
x 512 "Barbara/* which are shown in Figs. 9(a) and 
11(a). Coding .results for "Lena" arc summarized In Ta- 
ble UT and Fig. 9. Six scales of the QMF-pyramid were 
used. Similar results are shown for "Barbara" in Table 
IV and Fig. 10. Additional results for the 256 X 256 
"Lena" are given in [22]. 

Quotes of FSNR for the 512 X 512 "Lena" image are 
so abundant throughout the image coding literatim: that it 
is difficult lo definitively compare these results with other 
coding results. 1 However, a literature search has only 
found two published results where authors generate an ac- 
tual brt stream that claims higher PSNR performance at 
rates between 0.25 and 1 bit/pixel [12] and £21], the lat- 
ter of which is a variation of the EZW algorithm. For the 
''Barbara** image, which is far more diffimit than 
1 "Lena/* the performance using EZW is substantially bet- 
ter, at least numerically, than the 27.82 dB for 0.534 bpp. 
reported m [28] . 

The performance of the EZW coder was also compared 
to a widely available version of JPEG [14], JPEG does 
not allow the user 10 select a target bit rate but instead 
allows the user lo choose a "Quality Factor." In the ex- 
periments shown in Fig. 11, * 'Barbara 1 ' is encoded first 
using IP EG to a file size of 12 866 bytes, or a bit rate of 
0.39 bpp. The PSNR in this case is 26.99 dB. The EZW 
encoder was then applied lo * 'Barbara** with a target file 



'Actoallj tbere axe smluple vcxzaoBt of the knniziasoe ooly *~Lrn»" 

floating tround, ind the one oscd la (22) is dtnea and sfigbuy more diffi- 
cult itaa ihc ,, «|Ti*i»l" oik cbC>incd by Ihisavlbor jVot" fcFl *ft« [223 •»» 
pUbOibftd* Also natz Vbai this shook! nol be cncfcacd wilh results uiing 
only Hie trcca component of an ROB vennm *<fcich are abo comu&uity 
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Size of exactly 12 866 
dB, sigoiiicaAily higher 

then applied to 
obtain exactly the same PI 
size U 8820 byrca, or 0, 
EZW version looks bcti 
sou. White there U 



th n 
Ba baza' 



2 7 



some loss 



fit, 9. Fcftwnu&ce of EZW Coder upmuas on "txaa." (bJ Origin! 
512 X 3 12 "Leu" tafte n bttx/piiet <b) 1 .0 tila/pixcL 6 ; 1 Compre*. 
lifin, JPSKR » 39.55 dfi. (c) 0.3 bus/pixd |fi : I Corajrciaian. PSNR - 
36 .IS. <d| 0,2? hiu/piicl, 32:1 Om^nasnrfi. P5T*R - dB. (ej 

0.0673 Drf*/pi«L 122: 1 CronpftwiOB. FSNH « *1B, (O 0.015623 

bits/pi*ci. 512: 1 Ctenene^MO. PSNR =*■ 23.63 0B. 



The resulting PSNR is 29,39 
Tot JPEG. The EZW encoder 
using a target PSNR to 
NR of 26.99. The resoKing file 
bpp. Visual fy. ihc 0,39 bpp. 
than the 0.39 bpp, J PEC ver- 
of resolution in both, there 



are noticeable blocking an i Cacti in the JPEG version. For 
ihc comparison at ibe same PSNR, one could probably 
argue in favor of the JPEG. 

Another interesting figure of merit is the cumber of sig- 
piftcant coefficients retained. DeVore et cl. used wavelet 
trans rorro coding to progressively encode the same image 
f8). Using 68 272 bits. (8534 bytes, 0.26 bpp.), they re- 
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TAB1_£ IV 

Coding Rl4ui.ts dor 312 x 512 Barbaia Siswimg Peax-Svx^to- 

Noise <f*SNR) anh t*s NuMBBl O* Wavelet CotirwnorTj THAT ~Cji1 
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4J0L3 


2134 
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dit 


ojoisASa 








cm 


3M 
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tamed 20)9 coefficients am achieved a RMS cnor of 
15,30 <MSE - 2*4. 24.42 d* ). whereas using the embed- 
ded coding scheme. 9774 ct> fficients are retained, using 
only 8192 bytes. The PSNR for these two examples dif- 
fers by over 8 dB. Pan of the difference can be attributed 
la feet that the HaaT basis wai ised in However, doser 
examination shows thai (he eerotree coding provides a 
much better way of encoding the positions or the signifi- 
cam coefficient* ihsn was usi 1 in [8]. 

An interesting and pcitu ps surprising property of 
embedded coding u that wht » ihe encoding or decoding 
is terminated during the midd t of a pas*, or in ihe middle 
of the scanning or * sub-hand there are no artifacts pro- 
duced ih*« would indicate wf ere the termination occurs. 
In other words* some coefficic ais in the same sobband m 
represented with twice the prt :ision or the others. A pos- 
sible explanation of this phenomena is that at low rates 



ihere art so few significant 
not make a perceptible diff, 
is a dominant pass, setting 



ifficients thai any one does 
ice. Thus, if the last pass 
coefficient that might be 



significant to rcto may be imperceptible. Similarly, the 
foci that some have more precision than others is also inv 
pcrceptihJe. By the time the number of significant coeffi- 
cients becomes large, the picture quality is usually so good 
thai adjacent coefficients with differem precisions are im- 
perceptible. 

AnoUier interesting property of the embedded coding is 
that because of the implicit global bit allocation* even ax 
extremely high compression ratios, the performance 
scales- At a compression ratio of 512; 1, the image qual- 
ity »f "Lena** is poor, but still recognizable. This is not 
the case with conventional block coding schemes, where 
at SOCh high compression ratios, there would be insuffi- 
cient bits lo even encode the DC coefficients of cadi 
block. 

The unavoidable artifacts produced 3* low bit rates us- 
ing this method are typical of wavcJci coding schemes 
coded to ihe same PSNR's." However, subjectively, they 
are not nearly as objectionable as the blocking effects typ- 
ical of block transform coding schemes. 
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A new technique for ir 
that produces a fully emt 
the compression perfo 
peUrjve wilh virtually alJ 
able performance can be 
lowing four features: 



Ptg. IC. Performance or EZW Coder OfmliP£ OH "Barfcrftt" u f» 1.0 
bfU/Efecf* ft: I CocapKukm, PSNR - 35.1-4 dB (b) 0.5 t>Wpi**L l« i t 
Comprvstion. PSNR * 30.33 0&. CC) 0 i25 tuu/plxel. 64:1 Compro- 
lion, PSNR ~ 24.03 dB. (d) 0.0623 Vb/tnnxK 12* : I Corapfe**Mn. PSNR 
~ 23.10 dB, 



VH. C >NCLUSfON 



gc coding has been presented 
bit stream. Furthermore , 
cc of this algorithm is corn- 
techniques. The remark- 
ribvted to the u£e of the fol- 



a discrete wavelet transform, which deconclalcs 
rnost sources fairly well, and allows the more signif- 
icant bits of precision of mosi coefficients to be ef- 
ficiently encoded as part of exponentially growing 
ZETOIrceS. 

zeroiree coding, which by predicting insignificance 
across scales using an image model lhat is easy for' 
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iail 512 X 51? (b) £2W u H fit*, bykt», OJ9 bin /pixel. 19 dB, (c) 
E2W « e«70«yM4. 0.27 tit»/pi>dt 2199 «W, f«) J FCC *t 12 B6« byt». 
0.39 bils/pbtcL 26.99 dB. 



most linages io satis] 
gains over the first- 
mips, 

succcssive-approx. 
of multiple signifii 
allows the encoding 
adaptive arithmetic 



provides substantial coding 
Srdcr entropy for significance 

on. -which allows the coding 
:e maps using 2rro trees, and 
decoding to stop at any point, 
ing. which allows the enlropy 



coder to incorporate learning into the bit stream il- 
self. 

The precise rate control that is achieved with this nl- 
gorhhrD is * distinct advamage. The user can choose a Mi 
rale and encode the image to exactly the desired bii rate. 
Furthermore, since no training of aoy kind is required. 
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the algorithm is fairly ge end a rut perforrns remarkably 
well with ciosi types of ir ages. 
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